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Abstract 



Linear receivers are an attractive low-complexity alternative to optimal processing for multi-antenna 
MIMO communications. In this paper we characterize the information-theoretic performance of MIMO 

linear receivers in two different asymptotic regimes. For fixed number of antennas, we investigate the 
limit of error probability in the high-SNR regime in terms of the Diversity-Multiplexing Tradeoff (DMT). 
Following this, we characterize the error probability for fixed SNR in the regime of large (but finite) 
number of antennas. 

As far as the DMT is concerned, we report a negative result: we show that both linear Zero-Forcing 
(ZF) and linear Minimum Mean-Square Error (MMSE) receivers achieve the same DMT, which is largely 
suboptimal even in the case where outer coding and decoding is performed across the antennas. We 
also provide an approximate quantitative analysis of the markedly different behavior of the MMSE and 
ZF receivers at finite rate and non-asymptotic SNR, and show that while the ZF receiver achieves poor 
diversity at any finite rate, the MMSE receiver error curve slope flattens out progressively, as the coding 
rate increases. 

When SNR is fixed and the number of antennas becomes large, we show that the mutual information 
at the output of a MMSE or ZF linear receiver has fiuctuations that converge in distribution to a Gaussian 
random variable, whose mean and variance can be characterized in closed form. This analysis extends to 
the linear receiver case a well-known result previously obtained for the optimal receiver. Simulations reveal 
that the asymptotic analysis captures accurately the outage behavior of systems even with a moderate 
number of antennas. 
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1 Introduction 



The next generation of wireless communication systems is expected to capitalize on the large gains in spectral 
efficiency and reliability promised by MIMO multi-antenna communications [H 13 [THl [H] and include MIMO 
technology as a fundamental component of their physical layer [1] . The information theoretic analysis and 
the efficient design of space-time (ST) codes for transmission over these MIMO systems have been active 
areas of research over the past decade. Also, suboptimal low-complexity receiver schemes have been widely 
proposed and investigated as a low-complexity alternative to the optimal Maximum-Likelihood (ML) or 
ML-like receivers |10| ITT] . These schemes range from the iterative interference (soft) cancellation (e.g., 
[7j), to successive interference (hard) cancellation (e.g., |5l|6]), to the even lower complexity "separated" 
architecture, based on linear spatial equalization followed by standard single-input single-output (SISO) 
decoding]^ 

In this paper, we present two types of asymptotic performance analysis of this low-complexity MIMO 
architecture. First, we consider the Diversity- Multiplexing Tradeoff (DMT) [3], which captures the perfor- 
mance tradeoff between rate and block-error probability in the high-SNR, high spectral efficiency regime. 
We determine the DMT achieved by low-complexity MIMO architectures that use Zero-Forcing (ZF) or 
Minimum Mean-Square Error (MMSE) linear receivers and apply conventional SISO outer coding before 
the MIMO transmitter and conventional SISO decoding to the output of the linear receiver. The DMT 
analysis reveals that both ZF and MMSE linear receivers are very suboptimal in terms of their achievable 
diversity. Furthermore, we observe that while the DMT analysis accurately predicts the behavior of the ZF 
receiver at all finite rates, the performance of the MMSE receiver is in stark contrast to that predicted by 
the DMT analysis at low rates. In fact, we observe that for sufficiently low rates the MMSE receiver exhibits 
an ML-like performance. On the contrary, when working at higher rates (and correspondingly higher SNR) 
the MMSE receiver approaches the ZF performance. We provide an approximate analysis that explains this 
behavior both qualitatively and quantitatively. 

In the second part of this paper we take a closer look at the performance of the linear MMSE and ZF 
receivers at finite SNR. Since this is very difficult to capture in closed form, we explore a second type of 
asymptotic regime, where we fix SNR and let the number of antennas become large. Using random matrix 
theory, we show that in this case the limiting distribution of the mutual information of the parallel channels 
induced by the linear receiver is Gaussian, with mean and variance that can be computed in closed form. 
The analysis provides accurate results even for a moderate number of antennas and allows to quantify how 
the performance loss in terms of diversity suffered by linear receivers may be recovered by increasing the 

^It should be noticed that the current MIMO WLAN standard [l] is based on MIMO-OFDM, therefore, hnear equalization 
is performed in the space and in the frequency domains. For simphcity, in this work we restrict ourselves to the standard 
frequency- flat case where equalization is purely spatial. 
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number of antennas. This prompts to the conclusion that in order to achieve a desired target spectral 
efficiency and block-error rate, at given SNR and receiver complexity, increasing the number of antennas 
and using simple linear receiver processing may be, in fact, a good design option. 

The paper is organized as follows. In the rest of this section we briefly comment on concurrent existing 
literature. In Section [2] we define the system model and recall the main facts the ZF and MMSE linear 
receivers considered in this work. Section [3] presents the DMT analysis and some illustrative numerical 
examples. Section |4] is devoted to the fixed- rate analysis of the MMSE receiver performance with coding 
across the antennas and provides an approximate quantitative analysis of the slope of the error probability 
versus SNR. Section [5] deals with the limiting distribution of the mutual information for the MMSE and ZF 
receivers for a large number of antennas and provides some illustrative numerical examples on the validity 
and limitations of this analysis. Conclusions are pointed out in Section |6] and some technical details of the 
proofs are deferred to the Appendix. 



1.1 Related literature 

Since its introduction in the seminal work |3j , the DMT has become a standard tool in the characterization of 
the performance of slowly-varying fading channels in the high-SNR, large spectral efficiency regime. Space- 
time coding schemes have been characterized in terms of their achievable DMT in a series of works, including 
lattice coding and decoding [16] and ZF or MMSE decision feedback leceiveis (see for example |17| I18j). The 
multipath diversity achievable by linear equalizers in frequency-selective SISO channels has also attracted 
some attention and was recently solved in [19]. The spatial diversity achievable by MIMO linear receivers 
and separated detection and decoding was investigated in parallel and independently in [22] by the authors 
[^and in [20]. In this respect, it is worthwhile to stress the differences between the present work and |20] : 
1) we investigate the full DMT curve, while [20] focuses only on the fixed-rate case (corresponding to zero 
multiplexing gain); 2) [20] develops only lower bounds to the diversity order, based on upper bounds on the 
outage probability, while we have both lower and upper bounds and show that they are tight; 3) the analysis 
on the diversity order of the ZF receiver in [20] is fundamentally flawed for the case of coding across the 
antennas. In fact, [20.^ conjectures that the channel gains in the parallel channels induced by the ZF receiver 
are statistically independent. If this was the case, the diversity order would be very different, as detailed in 



a comment at the end of Section 3.1 Indeed, the final result in [20] is correct because of a compensation of 
errors. In contrast, we show that the channel gains are strongly correlated, and this is precisely why coding 
across the antennas does not buy any extra diversity with respect to pure spatial multiplexing; 4) in [20] the 
diversity of the MMSE receiver with coding across the antennas is characterized in the region of low rates 



■^The present paper provides the detailed proofs of the DMT results presented in [22] and presents the novel large-system 
finite-SNR analysis of the MMSE receiver, which is not given in [22| . 
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and high rates for the case of two transmit antennas. In contrast, the approximate analysis presented in 
Section [4] of this paper characterizes the diversity of the MMSE receiver for the whole range of intermediate 
rates from "low" to "high" and for arbitrary number of antennas. 

With respect to the large-system analysis of linear receivers presented in Section |5] we notice that 
asymptotic Gaussianity was shown for the MIMO channel mutual information given by the "log-det" formula, 
whose cumulative distribution function (cdf ) yields the block-error rate achievable under optimal decoding. 
This was shown in various works, such as j23| [Ml [^ . At the same time the marginal asymptotic 
Gaussianity of the SINR of a single MMSE and ZF receiver channel was derived in |27| |28] , without looking 
at the joint Gaussianity of all SINRs for all these channels. While the marginal Gaussianity is useful in 
the case of pure spatial multiplexing, where each antenna (or "spatial stream" ) is independently encoded 
and decoded, we would like to remark here that the joint Gaussianity is crucial in the analysis of the most 
relevant case where outer coding is applied across antennas. In Section [5] we characterize the limiting joint 
Gaussian distribution of the SINRs and obtain the statistics of the mutual information of linear MMSE and 
ZF receivers for the case of coding across the antennas. Our approach is novel and does not follow as a 
simple extension of the analysis of the marginal statistics as done previously. 

2 System model, DMT and linear receivers 

Fig. [1] shows three types of MIMO architectures, employing M transmit and receive antennas. Since the 
focus of this paper is on linear receivers, we shall assume N > M throughout this paper. Scheme (a) puts 
no restriction on the choice of the space-time coding and decoding scheme: the M channel inputs are jointly 
encoded, and the N channel outputs are jointly and possibly optimally decoded. Scheme (b) is based on 
interleaving and demultiplexing over the M inputs the codewords of a SISO code. A linear spatial equalizer 
(referred briefly as "linear receiver" in the following) processes each A-dimensional channel output vector 
(purely spatial processing) and creates M virtual approximately parallel channels (details are given later on). 
The output of these virtual channels are then demultiplexed and deinterleaved, and eventually fed to a SISO 
decoder that treats them as scalar observations, thus disregarding the possible dependencies introduced by 
the underlying MIMO channel. Notice that in scheme (b) coding is applied across the antennas. Finally, 
scheme (c) is based solely on "spatial multiplexing", that is, M independently encoded streams drive the 
M transmit antennas and are approximately separated by the linear receiver, the outputs of which are fed 
to M independent decoders. 

The output of the underlying frequency-flat slowly-varying MIMO channel is given by 

yt = U^t + wt, t = l,...,T, (1) 

where G C^^ denotes the channel input vector at channel use t, ~ CJ\f{0, AqI) is the additive spatially 
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and temporally white Gaussian noise and H G C^^^^ is the channel matrix. In this work we make the 
standard assumption that the entries of H are i.i.d. ~ CAA(0, 1), and that H is random but constant over 
the duration T of a codeword (quasi-static Rayleigh i.i.d. fading [H E]). The input is subject to the total 
power constraint 

^ ,E[||X||2,] <E,, (2) 



MT 

where X = [xi, . . . ,xt] denotes a space-time codeword, uniformly distributed over the space-time codebook 
X, and II • \\f denotes the Frobenius norm. Furthermore, following the standard literature of MIMO channels 
and space-time coding, we define the transmit SNR p as the total transmit energy per time-slot over the 
noise power spectral density, i.e., p = MEs/Nq. 

We assume no Channel State Information (CSI) at the transmitter. In this work we consider the case of 
very large block length (and consequently of very slowly- varying fading). Under the quasi-static assumption, 
it is well-known that the capacity and the outage capacity (or e-capacity) are independent of the assumption 
on CSI at the receiver [9] . Hence, assuming perfect CSI at the receiver incurs no loss of generality. 

We focus on the MIMO detector/decoder blocks in Fig.[T| Under the fully unconstrained ST architecture 
(a), the optimum receiver for the MIMO channel in ([T| is the maximum likelihood (ML) decoder, with 
minimum distance decision rule given by 

X = arg min || Y - HX|| 1. 

This entails joint processing of the symbols across all antennas at the receiver, over the whole block length 
T, and is typically implemented using algorithms like Sphere Decoding (see |10) and reference therein) and 
their tree search sequential decoding generalization [Tl] , possibly coupled with ML Viterbi algorithm if the 
underlying code has a trellis structure (e.g., [301 [31] )• The performance of this decoder is characterized by 
the information outage probability given by 

PoutiR, p) = inf P (logdet(/ + pHSH^^) < R) . (3) 
S : S ^ 

tr(S) < 1 

where the optimization is over the Hermitian symmetric non-negative definite matrix S subject to a trace 
constraint, reflecting the channel input power constraint ([2]). Several lower complexity suboptimal decoders 
have been proposed in the literature. In particular, architectures (b) and (c) in Fig. [T] involve a linear 
memoryless receiver defined by the matrix G, such that the output of the linear receiver is = Gyt. 
Classical choices for G are the ZF or the MMSE spatial filters, or any diagonal scaling thereof. Under the 
assumption of Gaussian inputs, very large block length T and ideal interleaving, the linear receiver creates 
M "virtual" parallel channels that, without loss of generality, can be described by 

y'k,t = VlkXk,t + w'k,t^ k = l,...,M, (4) 
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Figure 1: Three possible space- time architectures: (a) unrestricted space-time coding scheme; (b) cod- 
ing across the antennas, with hnear spatial equalization; (c) pure spatial multiplexing with linear spatial 
equalization, tt and vr"^ in (b) denote interleaving and de-interleaving. 
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where we normalize the input and output such that E[|a;fc^tp] = E[|t(;^jp] = 1, and where 7^ denotes the 
Signal to Interference plus Noise Ratio (SINR) at the A;-th linear receiver output]^ 

Under the above assumptions, the performance of such schemes is characterized by the following two 
outage probabilities. With coding across antennas (scheme (b)), the outage probability of interest is given 
by 

Pl-,(i?,p)^Pl>:iog(l + 7.)<i?); (5) 



Under pure spatial multiplexing (scheme (c)), the relevant outage probability is given by 

(6) 



p^^r\R,P) = p(^[j {iog(i +7.) < . 



where we used the fact that, by symmetry, without CSI at the transmitter the optimal performance of 
spatial multiplexing with linear receivers is achieved by allocating the same rate R/M to each stream. 

For completeness and for later use, we recall here the expressions of the SINRs for the ZF and the MMSE 
linear receivers. 

ZF receiver. In this case, the matrix G is chosen as G = DH+, where D is a suitable diagonal scaling 
matrix and is the Moore-Penrose pseudo-inverse of H |12) . Since H has rank M with probability 1, this 
takes on the form 

H+ = (H^H)^H^. 

In the absence of transmitter CSI, the signal power is allocated uniformly across the transmitter antennas. 
It is immediate to show that the SINRs on the resulting M parallel channels are given by 

7. = "/"^ (7) 
[(HHH)-!],,' ^'^ 

where the notation [Aj^j^. indicates the k^^ diagonal entry of a matrix A. 

MMSE receiver. In this case, the matrix G is chosen in order to maximize the SINR 7^ for each k, over 
all linear receivers. It is well-known that this is achieved by choosing G = DHmmse, where D is a suitable 
diagonal scaling matrix and Hmmse is the linear MMSE filter |12] that minimizes the MSE E[||x(— Hmmscyt 
Using the orthogonality principle, we find 

-1 

H^. (8) 



-n-mmse — j^j -n. 



I+-^HH" 



P . 



^In order to avoid any misunderstanding, it should be noticed here that "interference" is uniquely caused by the generally 
non-perfect separation of the transmitted symbols in xt by the linear receiver G. We consider a strictly single-user setting, with 
no multiuser interference. 
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A standard calculation |12] yields the SINRs 7fc of the resulting set of virtual parallel channels in the form 



7fc = ^h^ I + ^HfcH^ hk = r-, , -1, (9) 



M 



P xj.xjH 



M 



-1 1 



kk 

where denotes the x (M — 1) matrix obtained by removing the k^^ column, h^, from H. 

3 Diversity-Multiplexing Tradeoff 

A compact and convenient characterization of the tradeoff between rate and block-error probability of MIMO 
quasi-static fading channels in the high-SNR regime is provided by the DMT introduced by |3] . Consider a 
family of space-time coding systems, each of which operates at SNR p with rate R{p) and error probability 
Pe{p)- We say that this family achieves multiplexing gain r and the diversity gain d (i.e., the point (r, d) on 
the DMT plane) if 

lim — = r, lim — ^ — ^^^^ = —d. 

p-^co log p p^oo log p 

The latter relation is written briefly as Pe{p) = SNR^^ in the exponential equality notation of [3]. 

The optimal DMT is the best possible error probability exponent d*{r) achievable by any space-time 
scheme at multiplexing gain r. The standard theory of e-capacity f8i readily yields that d*(r) is equal to 
the negative /O-exponent of the information outage probability (|3]). For the space-time channel in ([T]), d*{r) 
is given by the piecewise linear function interpolating the points (r, d) with coordinates 

r = k, d={M-k){N-k) 

for A; = 0, 1, ... , min{M, N}, and is zero for r > min{M, N} [[3]. 

While d*{r) is achievable under the optimal receiver (a) in Fig{T| the following result characterizes the 
DMT of the MIMO channel in ([T| under schemes (b) and (c), when the linear receiver is either the ZF or 
the MMSE receiver defined above: 

Theorem 1 The DMT of the M -transmit, N -receive i.i.d. Rayleigh MIMO channel with N > M, con- 
strained to use Gaussian codes under either MMSE or ZF linear receivers is given by |^ 

dUr) = {N-M+l)(l-^y , (10) 

for both the cases of coding across antennas or pure spatial multiplexing. 
■'Note: (x)+ = max{x,0}. 
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Proof. The theorem is proved by developing upper and lower bounds on PI™^{R, p) for the MMSE 
receiver in the configuration (b) of the block diagram of Fig. [T| A simple upper bound on the outage 
probability for the ZF receiver extends immediately the result to this case. For configuration (c) the result 
follows as an immediate corollary. 

Lower bound on the outage exponent. Let Amm(-A-) and Amax(-A-) denote the minimum and max- 
imum eigenvalues of a Hermitian symmetric matrix A, and Ai < A2 < • • • < Am denote the ordered 
eigenvalues of the M x M Wishart matrix H'^H, with joint pdf given by |i4j 



where Km,n is a normalization constant and we have assumed M < N. 

Using we can write the mutual information with Gaussian coding across the antennas and the MMSE 
receiver as 




(11) 



mmse 





Since the function — log(-) is convex, using Jensen's inequality we have 



mmse 



(H) > -Mlog 





Using this bound in ^ we obtain 



}mmse 
out 





(13) 



where in the last line we let i? = r log p. Finally, we can use the trivial asymptotic upper bound 




(14) 
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First, we notice that the asymptotic outage probabihty upper bound in the RHS of (14) vanishes only if 
r/M < 1. Hence, the outage exponent lower bound is zero for r/M > 1. When r/M < 1, we can write 

&-1 M 



P Ai < 



-1 



< 



dXill 

1=2 
M 

dXiW 



Ai 



i=2 



dXi 



dXi 



p{X) 
p{X) 



Pi{Xi)dXi 

where ki is a constant and where we have used the well-known fact [4J that the marginal pdf of Ai = 
Amm(H'~'H), denoted by pi{X) in (15), satisfies pi{X) oc A^^*^ for small argument A ^ 1. The resulting 
outage exponent lower bound is 



d* 



,(r)>(iV-M+l) (l-^)^- 



(16) 



The same result can be obtained by following the by-now standard technique of [3] based on the change 

integrating the resulting pdf of ai, . . . ,aM over the outage region and applying 



of variable A,' 



P 



-cti 



Varadhan's lemma [3]. 

Upper bound on the outage exponent. Using the concavity and the monotonicity of the log(-) 
function, we obtain from (12) and Jensen's inequality that 



/ 1 *^ 1 

,(H) < Mlog - T T 

V^^flt (I + pHHH)-i 



(17) 



kk' 



Consider the decomposition H*^!! = U'^AU, where U is unitary and A is a diagonal matrix with the 
eigenvalues of H'^H on the diagonal. Defining u^ to be the W'^ column of U and e^ to be the column vector 
that has a one in the fc*'^ component and zeros elsewhere, we have that 



(I + /jHHh)-i 



kk 



e^uH(I + pA)-iUefc 
u^ (I + pA)-iufc 



I i2 
\Ulik\ 



El 



^ 1 + pA£ 
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Hence, the term inside the logarithm in (17) can be upperbounded as 

M 

M 



1 ^ 



1 



[(I + pHHH)-' 



1 



1 



kk 



k=l l^l=\ 1 
M 



A/ 
fc=l 

Let ^ denote the event X^^li j^^^^ ^ c|, where c is some constant (independent of p). We have that 



out 



\R,p) > P{A)P 



< 



R 

M 



A 



> P {A) Pi log iil + pXi)c)< 



R 

M 



P(log(l + pAi) < ^logp 



(19) 



where the last exponential equality holds if P {A) is a 0(1) non-zero term, i.e., it is a constant with respect 
to p bounded away from zero. This is indeed the shown rigorously in Appendix [A| 

It is immediate to check that the last line of ( 19 ) is asymptotically equivalent to ( 14 ) . Therefore, applying 
the same argument as in (15 1 we find that the upper bound on the outage probability exponent coincides 
with the previously found lower bound. 

The proof of Theorem [T] is completed by observing that in the case of the ZF receiver a lower bound on 
the SINR 7/; is readily obtained from the inequality 

1 1 



< Amax[(H'^H) ^] 



\kk ' ' A^i„(HHH) Ai' 

that holds for all /c = 1, . . . ,M. Using this in the mutual information expression for the ZF receiver with 
coding across the antennas we obtain 



P^UR,P) < P(log(l + pAi) < — logp 



(20) 



Noticing that (20) coincides with the asymptotic lower bound (19) for the MMSE receiver, and that the 
MMSE receiver maximizes the mutual information over all linear receivers, under Gaussian inputs and the 
system assumptions made here, we immediately obtain that the ZF also achieves the outage exponent dy^^{r) 
given in (10). 
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Finally, as far as spatial multiplexing is concerned (no coding across the antennas), it is clear from ^ 
and (p| that, for any linear receiver G, PI'^^{R, p) < P^^^^^^^ {R, p) . On the other hand, it is immediate to 



show that spatial multiplexing achieves the same DMT (10). Details are trivial, and then are omitted. □ 



3.1 Discussion and numerical results 

Theorem [T] shows that, in terms of DMT, there is no advantage in using interleaving and coding across the 
antennas when a linear receiver is used in order to spatially separate the transmitted symbols. In order 
words, the linear receiver front-end kills the transmit diversity gain offered by the MIMO channel. In fact, 
the DMT (A^ — Af + 1) (l — of Theorem [l] has the following intuitive interpretation: this coincides 
with the DMT of a SIMO (Single- Input, Multiple-Output) channel (receiver diversity only) with — M -|- 1 
receive antennas, used at a rate R/M. 

This fact shows also that the channel gains of the virtual parallel channels are strongly statistically 
dependent. For example, it is well-known that the ZF receiver applied to a M x channel with N > M and 
i.i.d. Rayleigh fading yields channel gains 7^ = |-(-pjHH-)-ij — that are marginally distributed as central Chi- 
squared random variables with 2(A^— M-l-1) degrees of freedom [21]. If the gains 71, . . . ,7m were statistically 
independent, by coding across the antennas we would obtain the DMT of the parallel independent channels, 
given by [T^] 

CralM,i.i.d.(0 > (A^ - M + 1) (M - r)+ , 

which is much larger than the DMT given by Theorem [T| In contrast, the channel gains in the regime 
of high SNR are essentially dominated by the minimum eigenvalue of the matrix H*^!! and therefore are 
strongly correlated: if one subchannel is in deep fade, they are all in deep fade with high probability. This is 
the reason why coding across the transmit antennas does not buy any improvement in terms of DMT with 
respect to simple spatial multiplexing]^ 

Having said so, we should also remark that the picture about linear receivers is not totally grim as it 
may appear from the high-SNR DMT analysis. Indeed, coding across antennas yields a very significant 
performance advantage with the linear MMSE receiver at fixed and not too large rate (notice that fix rate 
R corresponds to the case of zero multiplexing gain, r = 0.) In order to illustrate these claims, we provide 
simulations results for the following outage probabilities under i.i.d. Rayleigh fading: 

• MIMO outage probability ([3| with input covariance {p/M)\ (scheme (a) in Figjl]); 

• outage probability ^ with ZF and MMSE receivers with coding across antennas ((scheme (b) in 
FiglB 



^This also show that the assumption that the 7fc's are i.i.d., made in [501, is incompatible with the final result of that paper 
on the diversity of the ZF receiver. 
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• outage probability ^ with ZF and MMSE receivers under pure spatial multiplexing, i.e., without 
coding across antennas (scheme (c) in FigjT|. 

Fig. [2] shows the corresponding plots at rates R = 1 and 5 bits per channel use (bpcu). 
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Figure 2: Outage probabilities of ZF and MMSE receivers, 2x2 i.i.d. Rayleigh channel, R = 1 and 5 bpcu. 



Several interesting observations can be drawn from this figure. We observe that while at high rates the 
MMSE with coding across antennas behaves as predicted by the DMT analysis, the behavior at low rates 
is in stark contrast to the asymptotic result (this fact was also noticed in [20]). In fact, the MMSE exhibits 
an apparent "full diversity" behavior at small rate (e.g., R = 1 bpcu in Fig. |2]). In contrast, the behavior of 
the ZF receiver is accurately predicted by the asymptotic analysis at all rates. This remarkable behavior of 
the MMSE receiver is explained through an approximate analysis in Section |4j 

From Fig. |2] we observe also that coding across antennas does achieve an advantage over spatial multi- 
plexing. For the MMSE receiver operating at small rates the advantage is very significant, and corresponds 
to the diversity advantage discussed above. At high rates the advantage is moderate and consists only of a 
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horizontal shift (dB gain) of the error curve, not in a steeper slope. 



4 MMSE receiver with coding across antennas 

The difference between the performances of the ZF and MMSE receivers is best explained by comparing 



their corresponding upper bounds on outage probability in (20) and (13). While only the minimum eigen- 



value appears in the ZF case in (20), all eigenvalues play a role in the case of the MMSE receiver in (13). 
Although at asymptotically high SNR and high coding rates the minimum eigenvalue dominates (and there- 
fore determines the corresponding DMT), the other eigenvalues appear to be relevant at lower rates and 
provide higher effective diversity for the MMSE receiver. In order to substantiate this intuition, we compare 
in Fig. |3] the outage probability of the MMSE receiver with coding across antennas for the case M = N = 4 



with the corresponding upper bound in (13). The upper bound is found to be very accurate across a wide 



range of rates and SNRs. The particular choice of rates for this plot will be made clear in the sequel, where 



we analyze the high SNR behavior of the outage probability upper bound (13) 



Define Ik = jrhr and T = M2~m. We use a change of variables = p where denotes the 
level of singularity of the corresponding eigenvalue [3] . For ease of analysis we make the assumption that 
the channel eigenvalues fall into one of the following two categories: 

• Ofc < 1, i.e., Afc is "much larger" than the inverse SNR 1/ p: in this case, ^ as p — > oo. 

• Q!fc > 1, i.e., Afc is "much smaller" than 1/ p: in this case, Tfc — > 1 as /? — > oo. 

Recall that the {ai} are ordered according to ai > ••• > um- Suppose that the rate R is such that 
m — 1 < 1^ < m, for some integer m = 1, 2, . . . , M, i.e., 

M M 

Mlog — < R< Mlog . (21) 

m m — 1 

For alH = 1 , . . . , M define the event 

£i = {ai, . . . ,ai > 1} n {oi+i, ...,aM< !}• (22) 
Then, for large p, the following approximation holds 

{M -\ M 

^Tfc>Tl ^ y {ai,...,ai > l}n{ai+i,...,aM < 1} 

k=l J i=m 

= <Sm U <Sm+l U • • • U (Sm- (23) 

In the above approximation we are neglecting the cases where the eigenvalues take on values that are 
comparable with 1/p, and therefore contribute to the sum Tfc in (13) by a quantity between and 
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1. It can be expected that as /> — > oo, the probabihty of such intermediate values decreases, and our 
approximation becomes tight. 



Using the union bound, we find an approximate upper bound on (13) given by 

/ M \ M 



\k=l 



Defining P{£i) = p i = 1, . . . , M, using the joint pdf of the ctfc's, given by [3] 



M 



' p ""^ - p "j ) exp 



1=1 



i<j 



M 

1=1 



■ M 

.1=1 



-{2i-l+N~M)ai 



exp 



M \ 

i=l / 



and applying Varadhan's lemma as in |3j, we obtain 

M 

di{R) = inf ^(2j-l + M-iV) 



inf 

aj > 1 V j < i 3=1 
ttj < l\f j > i 
> V j 

i 

^(2j - 1 + M - iV) X 1 + ^ (2j - 1 + M - iV) X 
i=i 

i{i + N- M). 



M 



j=i+l 



From (24) and (25), we eventually conclude that 

This yields the diversity of the MMSE receiver with spatial encoding at a finite rate R as 

dmmse{R) ~ m{m + N-M). 



(24) 



(25) 



(26) 



In particular, when M = N, (ijnmse(-R) ~ ""t-^ where m and R are related by (21 ). 

To illustrate the effectiveness of the above approximation, consider the plots in Fig. [3] for the case 
M = N = 4. The coding rates are R = 0.7706, 2.7123, 5.6601 and 12 bpcu, corresponding to T = 
3.5, 2.5, 1.5 and 0.5 respectively. The diversities 16, 9, 4 and 1 predicted by the analysis in (26) well 
approximate the measured slopes (for high SNR) of the outage curves, that are 15.15, 10.69, 5.55 and 1.3 
in the log P^^'^^{R, p) vs. log p chart observed in Fig. Isl 
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R 


= 0.7706 


R 


= 2.7123 


R 


= 5.6601 


R 


= 12 



5 10 
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Figure 3: Diversity of the MMSE receiver with joint spatial encoding: solid lines represent the outage 
probability in ^ and the dash-dot lines represent the corresponding upper bounds (13). M = N = 4, rates 
R are in bpcu. 
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5 Outage probability of linear receivers in the large antenna regime 

In order to motivate this section, consider the following system design issue: for a given target spectral 
efficiency, block-error rate, operating SNR, and receiver computational complexity (including power con- 
sumption, VLSI chip area etc.) how many antennas do we need at the transmitter and receiver? Consider 
the outage probability curves of Fig. |4] and suppose that we wish to achieve a rate of i? = 3 bpcu with 
block-error rate of 10~^ at SNR not larger than 15 dB. With M = N = 2 antennas this target performance 
is achieved by an optimal receiver, but is not achieved by the MMSE receiver. However, with M = 2, N = 4 
or M = N = 3 the target performance is achieved also by the MMSE receiver. It turns out that, in some 
cases, adding antennas may be more convenient than insisting on high-complexity receiver processing. 




-10 -5 5 10 15 20 25 30 35 

SNR (dB) 



Figure 4: Comparing the outage probability of optimal and MMSE receivers, R = 3 bpcu. 



It is therefore interesting to analyze the outage probability of a linear receiver with coding across the 
antennas in the regime of fixed SNR p and rate R. This analysis is difficult due to the fact that, for 
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finite M,N, the joint distribution of the channel SINRs {jk} in Q escapes a closed-form expression. This 
problem can be overcome by considering the system in the limit of a large number of antennas. Specifically, 
we will show that the mutual information for the linear MMSE and ZF receivers becomes asymptotically 
Gaussian. Therefore, the outage probability for large but finite dimensions and fixed SNR can be accurately 
approximated by a Gaussian cdf with appropriate mean and variance, that we shall give in closed form. 

In the next subsection we will discuss the methodology used to show the asymptotic Gaussianity of the 
mutual information. The method is general and applies to both MMSE and ZF linear receivers. Subse- 



quently, in Section 5.2 we will calculate the first and second cumulant moments of the SINR for the MMSE 



and ZF receivers, which suffice to characterize the mutual information limiting distribution. 
5.1 Asymptotic Gaussianity of the mutual information 

The mutual information at the output of a linear receiver with M transmit and N > M receive antennas 
and coding across the antennas is given by 

M 

lN = Y,log{l+jk) (27) 

k=l 

with given by (|9| for the MMSE case and by ([7| for the ZF case. In the following, we fix the ratio 
/? = M/N < 1 and consider the limit for large N and the "fluctuations" around this limit. In order to prove 
the asymptotic Gaussianity of these fluctuations, we will need to analyze the characteristic function of the 
mutual information, given by 

= E [e^''^^'^] . (28) 
We start by considering the cumulant generating function [32J, defined as 



.jv(^) = log{^N{u;)) = V ^C„, (29) 

n=l 



where the coefficient C„ is the n-th cumulant moment of the mutual information. In general, the joint 
cumulant of m random variables Xi , . . . , Xm is defined as 



E,(Xi; . . . ; X„) ^ J^dTr] - 1)!(-1) J] ^ 



.i£B 



where vr runs through all partitions of {1, ... , m}, |7r| denotes the number of blocks in vr and B runs through 
the list of all blocks of vr. We will call the above moment irreducible, with respect to the random variables 
Xi, . . . ,Xm, when in each argument of the cumulant moment only one random variable Xi appears. By 
contrast a reducible cumulant moment with respect to the same random variables has arguments containing 
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mixed products of these random variables. In general, an n-order reducible cumulant moment can be written 
in terms of a sum of products of irreducible cumulant moments, with each term in the sum having moments 
with order summing up to n. 

The n}^ cumulant moment C„ of a random variable X is defined to be 

Cn = Ec{X;...;X). 

n times 

For example, the first few cumulant moments of X are 

Ci = E[X] mean (30) 
C2 = Var[X] variance 
C3 = Sk[X] skewness 



The probability density of In can be expressed in terms of (28) and (29) as follows 



-00 

'■00 / ,.,2 f^,..\n 



2tt 



^ I exp [ -ju; {y - Ci) - yCa + J] ^^C„ J du:. (31) 



In Section 5.2, we will show that in the limit of large N and M = j3N with /3 < 1, 

Ci = mi +0(1) (32) 
C2 = (j'^ + o{l) (33) 

where mi = Mcio + cii, and where cio, cn and o"^ are constants independent of for which we give closed- 
form expressions for both MMSE and ZF cases. In Appendix [D] we will also show that all higher-order 
cumulants of the mutual information asymptotically vanish for large N . Therefore, (p^i^) is a quadratic 
function of uj with corrections that vanish as ^ 00. As a result the mutual information is asymptotically 
Gaussian, i.e. 

In - mi d 



a 



AA(0,1). (34) 



This follows directly from (31 ) by setting y = z + mi and taking the large limit 

1 / 



p{z) = lim / exp ( -jujz - —a^ -h o(l) ) (35) 

1 -A 
e 2^ . 



\/27ro-2 
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Before moving on to the proofs, we would like to comment on the nature of this result. This states that the 
probability P{\In — nT-il > ^) approaches a Gaussian probability for sufficiently large and fixed distance 
z of the mutual information from its mean. This is quite different from stating that for fixed N the mutual 
information distribution falls off like a Gaussian random variable for any z and p. As a matter of fact, 
for fixed N and large enough SNR this Gaussian approximation is no longer valid, since the higher-order 
cumulants will no longer be small. 

It is also worth pointing out here that the variance of In is 0(1) (a finite constant) for large A^. This 
another manifestation of the fact that the SINRs of the parallel channels {7^} are strongly correlated, in 
agreement with the outage analysis of previous sections. In contrast, if they were independent, or nearly 
independent, the variance would be roughly linear in N, as the central limit theorem would suggest. This 
fact is in line with the well-known behavior of the mutual information logdet(I-|- -^HH'^) under the optimal 
receiver |23 | l24 | l25 l l26]. where again the variance is 0(1) for large N, indicating the strong correlation among 
the eigenvalues of H'^H. 

5.2 Joint cumulant moments of the SINRs of order 1 and 2 

Our goal is to calculate the cumulant moments of 1^. Since In consists of a sum of mutual informations of 



the virtual channels (see (27)), the n**^ cumulant moment of In can be written as 

M 

Cn= Yl lEe[log(l + 7fcJ;...;log(l + 7fcJ]. (36) 
The building blocks of the above cumulant moments are the joint cumulant moments of the SINRs {7^}, 



I.e. 



iEc[7fci;7fc2;---;7fe„]- (37) 



In fact, by expanding the logarithms in (36) in Taylor series, we can express (36) in terms of (37). Even 
calculating these joint cumulants amounts generally to a formidable task. However, with the help of Theo- 
rem |2] (Novikov's theorem) given in Appendix [B] and due to simplifications that occur in the large limit, 
we will show that this computation is possible. To obtain a feel for the computation, we will first calculate 
the first two joint cumulants of {7^} and defer the proof that the higher-order cumulants vanish sufficiently 
fast with to Appendix [D| 
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5.2.1 Cumulant moments for the MMSE receiver 



Starting with the case of the MMSE receiver, we recall from ^ that the SINR of the A;-th virtual channel 
induced by the MMSE receiver can be written as 

-1 



Ik 



ah]} 



I + aUkUl 



hi. 



(38) 



where is the x (M — 1) matrix obtained by eliminating the k-th column from the channel matrix H, 
and contains i.i.d. Gaussian elements ~ CAA(0, and we have defined for convenience a = pN/M = p/ (3. 

The asymptotic mean of 7^ in the limit of large N and M = (3N has been calculated in [HI] in the 
context of large-system analysis of CDMA with random spreading, and successively rederived in various 
ways (e.g., |33| [271 129]). Due to symmetry, the result does not depend on the index k. Hence, without loss 
of generality we can choose k = \. We have 



E[7i 



1 

N 



:Tr 



I + aHiHJ^ 



-1 



(39) 



The leading order in N of the above trace can be evaluated as 

1 



a, 13) 



lim aE 

a 



N 



:Tr 



I + aHiH^ 



-1 



1 + 



a/3 



1+9? 



Solving for g'^"^^^{a^ (3) in (40), we obtain 

\a,f3) = , 



9i 



a 



(1 - /3) - 1 + V(a(l-/3)-l)2 + 4a 



(40) 



(41) 



To be able to calculate the 0(1) correction to the mean mutual information, we need to evaluate the next to 
leading {0{1/N)) correction to ^[71]. The correction follows by noticing that the term /? in (|40]) should be 
replaced by the aspect ratio of the matrix Hi. For large but finite N, this is equal to (M — 1) /N = (3 — 1/N. 
Therefore, the correction can be evaluated by replacing f3 by (3 — 1/N in (41). Using the Taylor series 
expansion, this amounts to computing 



E[7i 



^mmse 
9l 



„mmse 
9l 



a, (3 ■ 
a, (3) 



where 



d_ 
dp 



9i' 



{a, (3) 



a 
'2 



N 



1 + 



5i— («,/?) +0(A^-2), 



a{l - /?) - 1 



V(a(l-/?)-l)2+4a 



(42) 
(43) 

(44) 
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For later use, we define also the following asymptotic moments 



lim a™E 

N-*oo 



iiv 

N 



(45) 



which can be obtained by repeatedly differentiating g'["^^^{a, (3) with respect to a using the recursive relation 

a" d 



m da 



gr'%a,P), m>l. 



Thus we have 



,2 d 



a 
~2 



(46) 

(47) 
(48) 



For large SNR, i.e. a = » 1 and /3 < 1, g^"^^^{a, (3) is approximately a(l — /3). This result indicates 
that only the ~ iV(l — /?) zero eigenvalues of the matrix HiHj^ contribute to the SINR for large a. Similarly, 
ff™(a, (3) « (1 -/3)a"" for large p and /3 < 1, while for (3 = 1, C'"'^!"^ /?) ~ kmP^-^'"^, where the constant 
fcm satisfies k^+i = YYj=i (1 " l/^i)- 

Next we calculate the matrix of the joint cumulants of order 2 with elements 



v-immse 



Given the symmetry, all diagonal elements {i = j) are equal, and so are all off-diagonal ones (i ^ j). 
Therefore, it is sufficient to compute and T,'^2^^- 

We start with Ec [71; 71]. For convenience, we define Bi = (l + aHiH^^) , and let (Bi)ij denote the 
{i, element of Bi and hu denote the i^^ element of hi . Then, a direct application of ( 38 ) and ( 85 ) yields 



Smm: 

1,1 



IEc[7i;7i] 



a 



^^E 



(Bi)afe(Bi)cd I h\Jiihh*iJiid 



WW 



a,b,c,d 
" 1 



iV2 



Tr(B?) 



N M 

where t;"^"^^^ 

only due to the random character of the vector hi [27] . 



+ 0{1/N^) 



(49) 



d ~ (^92"^^^ ("^5 ~ w)- ^^^^ leading correction in the autocorrelation is non-vanishing 
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We now turn to the more complicated computation of T,'^'^^^ to leading order in A^. To simplify notation, 
we define the matrices Bj = (l + aHjH[^) ^, for i = 1,2, as before, and B12 = (l + aHi2H5^2) ^ where 
H12 is obtained by striking out from H both columns hi and h2. Therefore, 



71 
72 



ah5;'Bihi 
ah^B2h2 



Using the same notation as before, we rewrite the cumulant moment of 71, 72 as 
lEc[7i; 72] =a'Y.^c [Ka (Bi),, hih-, hi, (B2),, h2d] 



(50) 



(51) 



abed 



In the following we will make extensive use of the following matrix identities, obtained by applying the 
Sherman-Morrison matrix inversion lemma. 



B2 
Bi 



B12 
B12 



Bi2hihi B12 



a 



Bi2h2h2 B12: 



1 +ahHBi2hi 

a 



(52) 



l + ahHBi2h2 

We will now use Novikov's theorem (Theorem |2] in Appendix [B]) to successively average over the variables 
hi and h2. For example, considering the general term for indices (a, b, c, d) in (51 ) we write 

E,[/iL(Bi),,/ii,;r2,(B2),,M = 
= E [hi, (Bi)„, h^,h*,, (B2),, M - E [hi, {B^),, h^,] E [h*,, (B2),, /i2d] 



1 

iV 
1 

iV 



E 

E 
1 



d 



dhia 
dhib 



-E 



Ohia 

dhia 



((Bl)ab(B2)cd^l6^2c^2d) 
l!Bi)ab(B2)cd/l2c^2d 



-E 



N 



d 



dhia 



{{Bl)abhlb) 



E [hUB2)cdh2d] 



2)cd 



dhia 



(Bl)abhlbh2ch2d 



(Bi) 



ab 



E [hUB2)cdh2d] 



(53) 



(54) 



where in (53) we have applied Novikov's theorem formally replacing hi, with jj-g§^ inside the expectations. 
We remind the reader that, as explained in Section |B] in the above manipulations we treat the complex 
variables h^a and /i* as distinct and independent for all k, n, a, b, such that partial derivatives are performed 
individually with respect to these variables. In order to compute we use the matrix inversion lemma 
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(52) for B2. After some algebra, we obtain 

lEc[7i;72] ■■ 



a 



N 
3 



a 



N 
4 



rE 



a 



-E 



TV(Bi);h^B2h2 

hHBi2BihihHBi2h2 

1 + ah'^Bi2hi 
hHBi2BihihHBi2hihHBi2h2 
(l + ahHBi2hi)' 



(55) 
(56) 

(57) 



The term in (55) results by summing over all indices the first two terms in (54), and the terms in (56) 
and (57) result by summing the last term in (54) after applying the partial derivative with respect to the 
elements of hi appearing in the numerator and the denominator of the matrix inversion lemma expansion of 
B2. It is important to notice that the order of magnitude of the first term is 0(1), while the last two terms 
are 0(1 /N). The reason is that the last two terms are the result of applying the partial derivative in hia to 
B2, where the term that depends on hi is scaled by a factor 0{1/N) compared to the remaining matrix. 

We proceed now by applying Novikov's theorem to the random variables appearing in the numerator 
of (56) and (57) and exchanging the corresponding expectation with a derivative d/dh2a- However, with 
some hindsight we only apply the derivative to h2 and not to Bi, which would give a subleading term in 
Therefore, to leading order in we have 



(56) 



a 



E 



(57) 



a 

iV2 



E 



h^Bf^Bihi 
1 + ahHBi2hi 

hHBi2BihihHBf2hi 

(l + ahHBi2hi)2 



a 



Tr(Bf, 

N 



1 



_mmse 
53 



iV2 1 + ^Tr (B12) N'^l + 



- (a, (3 



N ) 



a 



Tr(B 



N 



92 



n) 



iV2 H 



+ ^1Y(Bi2 



)y 



iV2 



(58) 



(59) 



where the approximation sign « means to leading order in 1 /N . The second expression in each line occurred 
by averaging over hi to leading order, i.e. only on the numerator. In the last equation in each line we used 
the fact that ^Tr(Bi2) 



5r=^ (a,/3-|). 



Next we may go back to (|55| and expand B2 using (52). After applying exactly the same methods as 
above we arrive at the following expression 



(55) 



52— 



N) 



1 



53— K/? 



2 

— 2Ec[Tr(Bi2) ;Tr(Bi2)] + — 2 , , ^..2 at2 , , mmse ^ p 



) 



_2_' 



(60) 
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We collect all terms and use (90) to reach the final result, 



1 



2 



(61) 



1 / ^g~{a,l5-^f 2gr'^{a,(3-^) 



+ 



M2 



+ 0(l/iV^ 



where we let 



_ .2^ 35r=^(a,/3-f)' 2ff3™(a,/3-|) 

^od — P 



4 



+ ^ ^ I • (62) 

l + 2a(l + /?-^)+a2(i_/3+^)2 



For large a, v'^J"^^ ^ when /? < 1, and vj^"^^^ ~ P^/IS when /3 = 1. 

We collect the results of (61) and (49) by writing the correlation matrix for the SINRs {7^} to leading 
order as: 

mmse mmse 

^r=k^^+i^-k3)^ (63) 

It is worth pointing out that despite the fact that the off-diagonal elements are much smaller compared to 
the diagonal ones, they all contribute to the eigenvalues of In fact, these can be computed in closed form 
and are given by 



Ai(S— ) = ^ + (M-l) 



M ^ ' M"^ M 

and 



'^od 



M ]VP M 
for all A; = 2,..., M. 



5.2.2 Cumulant moments for the ZF receiver 

The corresponding results for the ZF receiver can be derived directly from the previous section by observing 
that the SINR for the k-th channel of the ZF receiver, given by (ItI), can be deduced from the corresponding 
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expression 1^ for the MMSE receiver in the hmit of infinite a, i.e. 



it 



a 



(HHH 



-1 



kk 



a lim 

do — 



a lim 



ao 



(64) 



I + QoH"H 



)"■] 


1 




kki 



A subtle point needs to be stressed here: the results for the ZF receiver cannot be obtained simply as the 
"limit for high SNR" of the results for the MMSE receiver. Rather, we have to distinguish between the 
channel SNR (contained in the parameter a) and the SNR parameter in the linear receiver matrix expression 
(indicated by ao above) that we let to infinity in order to obtain the ZF results. It can be shown that for 
(3 < 1 the analysis of the previous section involving the matrices Bi,B2 and B12 can be carried out in 
this limiting case. In addition, as seen in Appendix [C| the condition for the validity of the manipulation 
of the first term of (61) is that f{x) = (1 + ax)~^ is a smooth function of x in the region of support of 
the eigenvalue spectrum. This is not true in the vicinity of x = for arbitrarily large a, specifically when 
a = 0{N). Thus when /3 = 1, in which case the asymptotic eigenvalue spectrum includes x = 0, the above 
approximation is not valid. As a result, this method breaks down at /3 = 1. 



From (42) we get the mean SINR for the ZF receiveij^ 

f a(l 



E[7f 



/3 + 1/iV) P < 1 
13 = 1 



(65) 



in agreement with |27j . Similarly, the second order moments can be obtained from 



Thus we get 



and 



yzf 

^11 



a lim 

ceo — ^o<3 



M 



7r""(«o);7"""(«o) 



a^l3(l-t3+l/N) 
M 





/3 < 1 
(3 = 1 



'12 



M2 



P < 1 



(66) 



(67) 



(68) 



While in the case of /? < 1 the covariance matrix SI is well-defined and positive-definite with eigenvalues 



Ai(E' 



'M2 '~ (3M 



''For simplicity we neglect the subleading terms in the following equalities, i.e., we omit 0(1/N^) in (651 and 0{1/N'^) in 



(67 1, (681, respectively. 
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and 

' M M"^^ pM 

for all /c = 2, . . . , M, the case /3 = 1 is problematic. Specifically, it results in (narrowly) negative eigenvalues 
for X), thereby invalidating the Gaussian approximation for the 7j's and as a result the further treatment of 
the mutual information as a Gaussian variable. The case (3 = 1 for the ZF receiver is therefore excluded in 
the subsequent Gaussian approximation of the mutual information. 

5.3 Gaussian approximation and outage probability 

In this section we use the previous results together with the asymptotic Gaussianity that follows from the 
fact that higher-order moments are vanishing (see Appendix |D]) to give an explicit Gaussian approximation 
for the outage probability of linear MMSE and ZF receivers with coding across the antennas in the regime 
of fixed SNR and large number of antennas. 

We start with the mean E[/7v]- Due to the symmetry with respect to the terms 7^, we have 

Ci=E[/jv] = ME[log(l + 7i)] = AfE[log(l+E[7i]+7i-E[7i])] (69) 

In the above expansion, all terms n > 2 involve cumulants of 71 higher than 2, thus can be neglected. For 
the MMSE receiver, this yields 

/ / 1 \ \ „.mmse 
^mmse ^ ] ^ Mlog 1 + g'^'"'^^ U, /3 - - - ^ + o(l) 

' V ' V Nj) 2(l + 5r"(a,/?-^))2 

/ 1 a \ mmse 

« Mlog l+5i™(a,/3)--— ffr=>,/3)^ ' 



Nd(5''' ' '^7 2(1+^i"^'^=^(q,/3))2 

Agmmse/Q, q\ mmse 



where g'^"^^^{a, 13) and v"^"^^^ are given by (41) and (49 1 respectively. 



For the ZF receiver the mean is given by 



a(3[l + ^ 



Cf = Mlog(l + a(l - 13)) + + (71) 
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Similarly, we can calculate the variance of the mutual information as follows 

C2 = MEe[log(l + 7i);log(l + 7i)] +M(M-l)Ee[log(l + 7i);log(l + 72)] 

mn (1+E[7i])"+"^ 
+ M(M-l) V (-ir^"^e[(7i-E[7.]r;(72-K[72]r]_ 

m,n=l ^ L fj-jy 

As we see above, the first terms in both the above summations give the leading order of the variance of the 
mutual information, which we denoted in (33) by o"^. The variance of the MMSE mutual information is 
given by 

„^mmse i „,mmse 

C2— = p^^— 2+0(1), (73) 



where v^^^^ is given by ( 62 ) . The corresponding variance for the ZF receiver is 

for P < 1. As mentioned above, the case /3 = 1 does not result in a well-behaved jointly Gaussian behavior 
of the 7fc's, and therefore the Gaussian approximation of the mutual information cannot be derived with 
this approach. 



As anticipated at the beginning of this section, from ( 70 ) and ( 71 1 we see that the mean mutual infor- 
mation is expressed in the form mi = Mciq + cn, where the coefficient cio was found in previous works 
(e.g., [M]), considering the limit ^ 00 of the normalized mutual information per transmit antenna, and 
the term cn is a correction term that captures the correlation between the SINRs 7^. 

Under this Gaussian approximation, we can easily evaluate the outage probability for fixed SNR, j3 and 
number of antennas M as follows: 

P^^iR, p) ^ Q ( ^:^) (75) 



/C2 

where Q{x) = -^^e~''^ ^'^dt is the Gaussian tail function. 

We conclude this section with a discussion on the range of validity of the Gaussian approximation. For 
the MMSE receiver, in the large p fimit we have that C^""^^ = 0{p) for /3 = 1, while C^""^^ = 0(1) for 
(3 <1. This fast increase of the variance of the distribution for (3 = 1 and p ^ 1 is a spurious result in this 
approximation, due to the neglected terms which are negligible for fixed p and increasingly large N, but 
become important for fixed N and large p. 

The behavior of the ZF receiver for (3 = 1, when the jointly Gaussian behavior of the 7fc's breaks down, 
exacerbates the above large p behavior of the MMSE receiver. In fact, as was discussed before, the ZF case 
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is in some sense the "infinite p limit" of tlie MMSE case. Tlierefore the problematic situation appearing in 
the ZF receiver for (3 = 1 has the same roots as the problems faced in the large (but finite) p limit when 
is also finite but not large enough. 



5.4 Simulations and comparisons 

In this section we first validate the asymptotic analysis by comparing the asymptotic approximation for Ci 
and C2 with the exact moments obtained by finite-dimensional Monte Carlo simulation. We then compare 
the Gaussian approximation to the outage probability with finite-dimensional Monte Carlo simulation. 

For the sake of comparison, we also consider the outage probability of the optimal receiver, given by 
the log-det cdf < R), where 1^^* = logdet(I -|- -^HH'^). As said in the Introduction, the asymptotic 

Gaussianity of the log-det mutual information is well-known and holds under very general models of channel 
correlation across the antennas (not considered in this work). For completeness, we recall its expressions 
under the i.i.d. channel coefficient assumptions and in the notation of this paper. We have 



AA(0,1) (76) 



where 



and 



p = log(l + gr'^{a, 13)) + i log(l - a(l - /3) + g^— (a, (3)) + " ^, (77) 



= -log I 1-^(1- -^ ---- ) 1. (78) 




The first term in (77) coincides with the coefficient of M in the first term of (70), i.e., it is the asymptotic 
capacity per antenna of the linear MMSE receiver. The additional terms in (77) represent the so-called 
"non-linear" gain of the optimal versus linear MMSE receiver, as discovered in [35] . 

It is worth pointing out that for large SNR, the asymptotic mean capacity is given by 

E[lX]-Mlogp (79) 

while the variance has the following behavior 

\ llogp 13=1 

Using the Gaussian approximation 
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with R = r log p, in the large p limit we find 



log Pont{R,p) ^ ( A/T \2 logp 

' ' V^*^ 2|log(l--/3)| P <- ^ 



logp -y'" ') 2|log(l~/3)| - /o.^ 



For (3 = 1, the Gaussian approximation yields an outage probability exponent equal to (r— M)^ for r G [0, M], 
that closely approximates the exact exponent d*{r) |3]. However, for /? < 1 the Gaussian approximation 
yields a completely inaccurate behavior. In fact, in this case the exponent obtained through (81) would be 
infinite, while we know from [3] that d*{r) < MN. The reason for this spectacular failure of the Gaussian 
approximation is that, in the large- approximation, it is implicitly assumed that for (3^1 the eigenvalue 
distribution at very small eigenvalues is zero, as described by the Marcenko-Pastur law Instead, for 
large p the eigenvalues that dominate the outage probability are exactly the very small ones, of the order of 
1/p, i.e., exactly the ones that the Gaussian approximation neglects. 

We conclude this section by presenting some numerical results. Fig. |5] compares the analytical mean 
of the MMSE mutual information per antenna C'^'^^^/M with the corresponding empirical mean obtained 
from Monte Carlo simulation. The corresponding comparison between the analytical variance C^"^^^ and the 
empirical variance is presented in Fig. [6j Using the results for the mean and the variance, we plot the CDF 
of the (Gaussian) mutual information for the MMSE and optimal receiver in Figs. 7|8 Both analytical and 
empirical results are plotted, for a wide range of M, and SNRs. For brevity, we have only reported plots 
of the CDF for the mutual information for the ZF case, see Fig. [9] The plots are for M = 3, 10, /3 = 0.5 
and /9 = 3, 30 dB. The results are similar in flavor to the MMSE case. 

We notice that the analytical and empirical results match closely, for even moderate number of antennas 
and not too large SNRs, in line with the comments made earlier regarding the validity of the analysis. It 
is also worthwhile noticing that the accuracy of the Gaussian approximation for linear receivers appears to 
be slightly inferior to that of the Gaussian approximation for the optimal receiver case, especially for very 
small N and large SNR. 



6 Conclusions 

Novel wireless communication systems are targeting very large spectral efficiencies and will operate at high 
SNR thanks to hot-spots and pico-cell arrangements. For example, a system with bandwidth of 20 MHz 
and operating at 100 Mb/s requires a spectral efficiency of 5 bit/s/Hz, corresponding to coding rate R = 5 
bpcu in notation adopted here, if one neglects non-ideal effects such as pilot symbols, guard band and 
guard intervals, cyclic prefix redundancy for OFDM, etc. For such systems, the use of low-complexity 
linear receivers in a separated detection and decoding architecture as those examined in this paper may be 
mandatory because of complexity and power consumption. 
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M = 2 



-SNR = 30 dB 



M = 5 



-SNR = 10dB 



SNR = 3dB 




O 




M = 10 



M = 20 




O 




Figure 5: Mean of the MMSE mutual information per antenna {C^"^^^ /M) as a function of /?, for M = 2, 5, 10 
and 20. The solid lines are analytical results, and the corresponding dash-dot lines are empirical results 
obtained from Monte Carlo simulation. Diamonds denote 3 dB, circles 10 dB and triangles 30 dB. 
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Figure 6: Variance of the MMSE mutual information (02"^^^) as a function of /?, for M = 2,5, 10 and 20. 
The soUd hues are analytical results, and the corresponding dash-dot lines are empirical results obtained 
from Monte Carlo simulation. Diamonds denote 3 dB, circles 10 dB and triangles 30 dB. 
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M = 2, N = 4, SNR = 3 dB M = 2, N = 4, SNR = 30 dB 




Ml (nats) Ml (nats) 

Figure 7: CDF of the mutual information (MI) for the MMSE and optimal receivers, for M = 2,3, (3 = 0.5 
and p = 3, 30 dB. The solid blue line is the analytical result for the MMSE, the dot-dash blue is MMSE 
empirical, the dashed black is Optimal receiver analytical and the dotted black is the optimal receiver 
empirical. 
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M = 5, N = 10, SNR = 3dB 



M = 5, N = 10, SNR = 30dB 
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Figure 8: CDF of the mutual information (MI) for the MMSE and optimal receivers, for M = 5, 10, /5 = 0.5 
and p = 3, 30 dB. The solid blue line is the analytical result for the MMSE, the dot-dash blue is MMSE 
empirical, the dashed black is Optimal receiver analytical and the dotted black is the optimal receiver 
empirical. 
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CDF of the Ml, M = 3, N = 6, SNR = 3 dB 



CDF of the Ml, M = 3, N = 6, SNR = 30 dB 
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CDF of the Ml, M = 10, N = 20, SNR = 30 dB 
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Figure 9: CDF of the mutual information (MI) for the ZF and optimal receivers, for M = 3, 10, /? = 0.5 and 
p = 3, 30 dB. The solid blue line is the analytical result for the ZF receiver, the dot-dash blue is ZF empirical, 
the dashed black is Optimal receiver analytical and the dotted black is the optimal receiver empirical. 
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In this paper we investigated the asymptotic performance of such separated Hnear detection and decoding 
architectures in two relevant asymptotic regimes. In the regime of fixed number of antennas and increasing 
SNR and coding rate, we showed that Hnear detection may be very suboptimal. Furthermore, due to the 
strong correlation between the SINRs of the parallel channels induced by the linear receiver, coding across 
the antennas does not help in terms of the achievable Diversity-Multiplexing Tradeoff. We also illuminated 
the very peculiar behavior of the linear MMSE receiver with coding across the antennas, that exhibits 
a diversity order (slope of the outage probability curve) that changes depending on the rate. Then, we 
analyzed the asymptotic behavior of the linear MMSE and the ZF receivers with coding across the antennas 
in the regime of fixed SNR and large (but finite) number of antennas. We showed that the corresponding 
mutual information has statistical fluctuations that converge in distribution to a Gaussian random variable, 
and we computed its mean and variance in closed form. This yields a simple Gaussian approximation of the 
outage probability in this asymptotic regime, within the limitations that have been thoroughly discussed. 

Based on the analysis carried out in this work, we may summarize some considerations on system design. 
In order to achieve a required target spectral efficiency at given block-error rate and SNR operating point, 
an attractive design option may consists of increasing the number of antennas (especially at the receiver) 
and using a low-complexity linear receiver. However, pure spatial multiplexing (independent coded streams 
directly fed into the transmit antennas) and/or linear ZF receivers should be avoided. In contrast, coding 
across antennas and a linear MMSE receiver can achieve a very good tradeoff between performance and 
complexity in a wide range of system operating points. 



37 



APPENDIX 



A Proof of P{A) = 0(1) 



Our aim is to provide a lower-bound to the quantity 



It is well known that U and A are independent random matrices and that U is Haar distributed (i.e., is 
an isotropically random unitary matrix distributed uniformly over the Stiefel manifold). Therefore, the 
vector ui = (ni^i,tii^2) . . • ,'Ui,m) corresponding to the first row of U is uniformly distributed on the unit 
M-dimensional hypersphere (or M-sphere) OM(0,l)J^and satisfies |uip = Y^iLiWiii^ = 1. The point 

is a point on the unit M-sphere. Consider the spherical cap C of the unit sphere 



P 



1 



1 



that is cut out by the sphere ©^/(p, e), where e is a small positive number (See Fig. 10) 




Figure 10: The unit hemisphere and a spherical cap. 



'^We will use the notation Om{c, 5) to denote an M-sphere centred at c with radius 5. 
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The coordinates of any point u in this spherical cap C is lower bounded by 

1 

Uj > —= -€, V 7 = 1, . . . , M. 

Therefore, 

2 



Defining the constant c = (^'^^ ~ £^ > we have that 



P{A) > P (ui G C) 

Surface area of C 



Surface area of Om(0, 1) ' 

where the latter equality holds since ui is isotropic. In order to compute the above surface areas, consider 
Fig. 10 The surface area of C, denoted by $1(0) is given by [2] 

and the surface area of an unit M-sphere is [2j 5m(1) = M7r*^/7r(M/2 + l). All that remains is to compute 
the angle (p, which is accomplished by solving the following equations obtained from the two right-angled 



triangles in Fig. 10 



1,2 I 2 2 

a^ + {l-hf = 1. 



Solving for a, h, we obtain 



Therefore, 



tan' 



and 



as desired. 
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B Novikov's Theorem 



We introduce here a useful trick which makes the connection between Gaussian integration and differentiation 
over Gaussian random variables. 



Theorem 2 (Novikov) Let H be an N x M matrix with i.i.d. elements drawn from CM{0, 1/-/V). Let 
/(H,H'^) be a scalar function of the matrix elements of H and their complex conjugates, which does not 
grow faster than the inverse of the probability density o/H, i.e., such that 



lim p(H)/(H,H^) = 

\hij\^co 

for all matrix elements hij. Then, for any set of indices i,j, the following relation holds 



(82) 



E 



/ii,/(H,HH) 



N 



g/(H,H^) 



(83) 



where hij and h*- are to be treated as individual variables in the differentiation^ 



Sketch of the Proof. Even though the general proof is involved, we present here a simple proof for a 
single real Gaussian variable AA(0, 1) by integrating by parts: 



E[z/(z)] 



-7^zf{z)e 2 



(84) 



e 2 



V2tt 



dz , .2 
+ / ^/'(z)e-^ 



— oo 



/27r 



□ 

Example. A useful and illustrative example of the application of Novikov Theorem is to evaluate the 
fourth moment of a i.i.d. complex Gaussian vector with elements drawn from CAA(0, 1/iV). We have 



1 



-E 



d_ 
dhi 



{hjhlhi) 



^E [hlhi] + |e [hlh,] 



(85) 



Sij5ki _|_ Sii5kj 



iV2 



This means that 



when computing the partial derivative in (83 1. 



40 



C Fluctuations of eigenvalues 

Let 



B = f I + aHH^^ 



-1 



(86) 



where H € C ^ has i.i.d. elements CAA(0, 1/A^), and let /? = M/N. The normalized trace of B is given by 



N 



:Tr (B) 



1 ^ 1 

-Y 

N ^ 1 + aAfci 



Afc(HHH) 



where 'q^^\a) is the r/-transform |37j of the empirical eigenvalue distribution of HH'^. For large A'^, the 
variance of Tr (B) = Nrj^^\a) is of order unity and can be calculated in closed form |38l l39] . 



Ee[TV(B);TV(B)] = V 



dX 



dfilC2{X, fJ.)- 



1 



(87) 



where V denotes the principal part of the integral, A 
the Marcenko-Pastur law [37], given by 



mm,max 



(1 + aA)(l + a^)' 
denote the extremal values of the support of 



A, 



m,in,max 



and /C2(A, /i) is an integral kernel representing the deviation of the joint eigenvalue distribution of the 
eigenvalues A, from the product of their marginal distributions and is given asymptotically by 



/C2(A,//) 



1 



1 



d 



^■^^ a/(A — Xmin) (Xmax — A) dfl 



\J {fl — Xmin) (A,^ 



X — 



It can be verified that the above function is symmetric in A and /i. After integration by parts, we get 



[Tr(B);Tr(B)] 



1 

2^ 



V 



dX 



dfi 



(l + 2a(l + /3) + a2(l-/3)2)2 



(A — Xmin) (Ar; 

0(1). 



A) 



1/2 



a 



in - A) (1 + aX) (1 + a^if 

(90) 



The result (90) is used in the calculation of the correlations of 71, 72 in (61 ). 

Furthermore, in |38| |39] two important and more general results are shown. In particular, for any 
functions /i(x), f2{x) the following result is true in the limit of large N 



E,[Tr(/i(B));Tr(/2(B))] = V 



dX 



df,IC2iX,fi)h{X)f2ifJ.)=0{l) 



(91) 



as long as these functions are bounded and smooth enough within the support of the asymptotic eigenvalue 
spectrum (for example f{x) = is not smooth, while f{x) = a/(l + ax)'^ is smooth). 
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Also, in |38| [39] it is shown that all higher order cumulants of such smooth functions vanish in the large 
N limit, i.e., for n > 2 



Tv(/i(HH^^));...;Tv(/„(HHH) 



Qn{N) = o(l) 



(92) 



where we have denoted the (arbitrary for our purposes) scaling of the above cumulant moment with as 
Qn{N) for future reference. We will use this result in Appendix |D] to prove that all cumulant moments of 
Im of order higher than two vanish for large A^. 



D Higher order cumulants are vanishing 



After the calculation of the mean ( 70 ) and the variance ( 72 ) of the mutual information in Section [5] we now 
need to show that the higher order cumulants of the mutual information vanish in the limit of large N. 
This will conclude the proof of the asymptotic Gaussianity of the mutual information, as discussed in the 
beginning of Section [5| 



We need to show that the cumulant moments defined in ( 36 ) as 

M 

Cn= Yl Ec[iog(i + 7fci);---;iog(i + 7fcJ 



vanish for n > 2 when A^ 
in fact of order o(l). 



oo. Despite the fact that C„ is a sum of 0(A") terms, we shall show that it is 



D.l MMSE receiver higher order cumulants 

We discuss in some detail the case of the MMSE receiver. At the end of this appendix, a short argument is 
given in order to reach the same conclusions for the ZF case. 

While a formal proof would be lengthy and tedious and would not add much value to the paper, we shall 
provide a sketch the basic steps of the proof leaving out several technicalities. We start by recalling that 
each 7fc, in the above sum is defined as 7^. = ah^.B/j-h^., where B^. = (l + aHH'^ — ah^-h^.) ^ . Using 
the matrix inversion lemma, we see that the denominator in the expression of B^. includes all columns of 
H other than h^.. For every n-tuple {ki,k2, ■ ■ ■ ,kn} we define the matrix Bj/j.}, such that 

^{k^} = + , 

where Hj/,.} is obtained by removing columns h^^, h^j, • • • , i^k„ from H. If for some i ^ j we have ki = kj, 
then we only remove column ki once. For any finite n, the following approximation holds 

Bfc, =B|fcj + o(l), (93) 
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in the sense that the elements of the matrix iV(Bjfc. — B^;;.,}) almost surely finite in the limit of large 
N. Roughly speaking, since the elements of H are zero-mean Gaussian with variance 1/N, the difference 
B^.^ — B^^^-^ = oiYlkj^ki ^''i^kj ^ds a term of order 0{1/N) to each element of B, which can be neglected 
for large N. For example, of the above approximation can be proved in an iterative fashion, by showing that 
Bfcj - Bfc^^fcj is 0{1/N), then adding to writing B^-^ = B^^ - Bfej^fca + ^kiM - '^kiMM and then showing 
that Bfc^^fcj ~ '^ki,k2.k-j is also 0{1/N) etc. Each such difference can be shown to be almost surely 0{1/N) 
by applying the matrix inversion lemma and observing that the elements of hy^.. are CA^(0, 1/N). 
As a result, to leading order in we have 

Cn ~ 5]Ee[log(l + ahlB{k,}hk,y, . . . ;log(l + ah^^B^^^hfeJ] 

{ki} 

where J2{ki} ^ '^^^^ possible 1 < . . . , A;„ < M. 

Since now the random vectors h^^.^, . . . , hk„ do not appear in Bj^^j, wc can explicitly average over them 
in the above expression. At this point it is convenient to expand the logarithms in Taylor series, such that 
each term in the sum above becomes 



oo / ^h+...+l„ „ T/ .. Nil / .. ^l„ 



(-«) ' " V 
(i,.. .,(„=! {ki} 



E, 



(94) 



We may now apply Novikov Theorem to average over the hfc.'s. This is in general a formidable exercise 
in combinatorics. Instead, we only need to find how the leading terms scale with N. Specifically, in the 
following we will fix the n-tuple Zi, Z2 ■ ■ ■ ,ln and show that the corresponding term of the type Yl{ki} ^c[] 
is 0(1) as iV — 00. 

We first notice that since there are L^i.j = li + . . . + In pairs of h'^'s and h's, we will have an overall 
factor of N'^f^i after averaging over all h's. Also, we can decompose the sum over {ki} into subsets or 
"shells", where each shell has the same number of distinct indices fcj's. For example, there arc = 0(A^") 
terms containing all distinct indices, and nM!/((M — n+l)!(n— 1)!) = 0(A^"^^) terms having n—1 distinct 
indices and one repeated index. In general, the shell with q distinct indices contains 0{N'^) terms. 

If the ki's are all distinct, then the resulting cumulant moment will be of order n. Possible terms that 
may appear include, for example, 

E,[Tr[B|,,}]S...;TV[B|fc^}]'"] (95) 
E,[IV [B{fe,}]^-'TV [bJ,^}] ; . . . ; IV [B^^,}]^"] 

where a term as in the second line can only appear for li>2. Therefore, after averaging over the hjt.'s we are 
left with order-n moments, having as arguments products of the random variables TrB^^-^, for 1 > ruj > Ij, 
with j = 1, . . . ,n. Let Xj be the total number of traces appearing in the j-th. argument of a particular term. 
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such that Xj < L. For example, in the first hne of (95), we have xi = 1, while in the second line we have 



xi = 2. Since generally Xi > 1, such cumulant moments can be reduced to sums of products of irreducible 
moments with respect to these random variables. To estimate the leading scaling in iV of these reducible 



moments, we recall that the first moment of the trace is Ec[TrB|^, |] = 0{N), the second cumulant moment 
is 0(1) (91), while all higher cumulant moments are o(l) (92). Let the leading term in the expansion of 



the reducible moments into irreducible ones have di cumulants of order one, d2 cumulants of order 2, 
cumulants of order 3 etc. The only constraint we need to impose is that 



di < di„ 



E 



Xi 



which is valid due to the shift-invariance of irreducible cumulant moments of order higher than one. In the 
case that di = dimax, we need to have dn = 1 and dk = for all k ^ l,n. Collecting all powers of N this 
term will be of order iV""'^{'«}+'^i'"°- Q„(iV) < Q„(iV) = o(l), where we recall that the factor A^" is due to 



the 0(A^") possible combinations of the distinct /cj's that appear in the corresponding sum in (94), while 
the factor N'^^'^i comes from the averages over the h^. . Otherwise, when di < dimax, the leading term 
will have, if that is at all possible, d2 = Q2i ~ di)/2 and dk = for k > 2. In this case, the scaling of this 
term with N is Ar"-^{'a+'^i < N'^ = o{l). 

The above argument can be extended to the case when there are q < n distinct A^j's. The difference is that 
now additional terms may appear, since Novikov's formula may give derivatives across different arguments 
of the cumulant moment, thus reducing the order of the cumulant, e.g. if ki = k2 we will get the term 



Kc[TY [B 



Tr [B 



Tr 



B 



{k^} 



;TV[B 



;TV[B 



(96) 



In general, for the shell with q < n distinct indices the resulting terms will include cumulant moments with 
orders m, such that q < m < n. When m = n, we can directly apply the argument used when q = n, only 
replacing the number of possible combinations of the distinct combinations from N"" to N''. 

In the case that m < n, when one expresses the reducible cumulant moments in terms of sums over 
products of irreducible ones, the maximum number of order one cumulants that may appear is now bounded 
by 



dl < t^lmax 



i=l 



[xi - 1]^ < - q. 



The crucial difference is that d'lmaa: < ^{'J ~ 9' which is due to the fact that, as seen in (96), in order 
to reduce the order of the cumulant moment to m < n, one needs to produce traces that span different 
arguments of the original cumulant moments. In this case, the leading term will be of order N'^~^{'-i}^'^^ < 
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Following the above argument, we can show that all cumulant moments of the mutual information with 
order n > 2 are negligible in the limit N ^ oo. 



As far as the ZF receiver is concerned, recall that (see Section 5.2.2) we can obtain the SINR of the 



virtual channels of the ZF receiver by setting the parameter a inside the corresponding SINRs expression of 
the MMSE receiver to oo. Specifically, expressing the relation in terms of the matrices B^, where we have 
explicitly denoted the dependence on a, we have 



a lim h^Bfc(ao)hfe 



(97) 



a lim 



I + aoHfcH^ 



-1 



hi 



As mentioned in Section 5.2.2 the above analysis involving the matrices ^{ki} stc. can be carried out in the 



case of zero- forcing if /3 < 1. In addition, as seen in Appendix [C| for /? < 1 all n-th order cumulant moments 



of traces of products of Bj^,} are given by (91) and (92). As a result, all finite n > 2 order cumulant 



moments of the mutual information are o(l) for the ZF receiver too. 



45 



References 

[1] Draft Standardization Document: IEEE P802.11n/D2.00, February 2007. 

[2] Claude E. Shannon, "Probability of Error for Optimal Codes in a Gaussian Channel," The Bell System 
Technical Journal, vol. 38, no. 3, pp. 611 - 656, May 1959. 

[3] L. Zheng and D. Tse, ''Diversity and multiplexing: A fundamental tradeoff in multiple- antenna chan- 
nels," IEEE Trans. Info. Theory, vol. 49, no. 5, pp. 1073-1096, May 2003. 

[4] I. E. Telatar, "Capacity of multi-antenna Gaussian channels," Europ. Trans. Telecomm., vol. 10, no. 6, 
pp. 585-595, Nov.-Dec. 1999. 

[5] P. Wolniansky, J. Foschini, G. Golden and R. Valenzuela, "V-BLAST: an architecture for realizing very 
high data rates over the rich-scattering wireless channel," Proc. of 1998 URSI Int. Symp. on Signals, 
Systems, and Electronics, pages 295-300, 1998. 

[6] G. Caire and G. Colavolpe, "On low-complexity space-time coding for quasi-static channels," IEEE 
Trans, on Inform. Theory, Vol. 49, No. 6, pp. 1400-1416, June 2003. 

[7] H. El Gamal and R. Hammons, "A new approach to layered space-time coding and signal processing," 
IEEE Trans, on Inform. Theory, Vol. 47, No. 6, pp. 2321-2334, June 2001. 

[8] S. Verdii and T. S. Han, "A General Formula for Channel Capacity," IEEE Trans, on Information 
Theory, vol. 40, no. 4, pp. 1147-1157, July 1994. 

[9] E. Biglieri, J. Proakis and S. Shamai, "Fading Channels: Information-Theoretic and Communications 
Aspects," IEEE Trans, on Inform. Theory, Vol. 44, No. 6, pp. 2619-2692, Oct. 1998. 

[10] M. O. Damen, H. El Gamal and G. Caire, "On maximum likelihood detection and the search for the 
closest lattice point," IEEE Trans, on Inform. Theory, Vol. 49, No. 10, pp. 2389-2402, Oct. 2003. 

[11] A. D. Murugan, H. El Gamal, M. O. Damen, and G. Caire, "A Unified Framework for Tree Search 
Decoding: Rediscovering the Sequential Decoder," IEEE Trans, on Inform. Theory, Vol. 52, No. 3, pp. 
933 - 953, Mar. 2006. 

[12] Sergio Verdii, Multiuser detection, Cambridge Univ. Press, 1998. 

[13] E. Biglieri and G. Taricco, "Transmission And Reception With Multiple Antennas: Theoretical Foun- 
dations," Foundations and Trends in Communications and Information Theory, Now Publishers Inc., 
2004. 



46 



[14] A. Paulraj, R. Nabar, and D. Gore, Introduction to Space-Time Wireless Communications, Cambridge 
University Press, Cambridge UK, 2003. 

[15] D. Tse and P. Viswanath, Fundamentals of Wireless Communication, Cambridge University Press, 
Cambridge UK, 2005. 

[16] H. El Gamal, G. Caire and M.O. Damen, "Lattice Coding and Decoding Achieve the Optimal Diversity- 
Multilpexing Tradeoff of MIMO Channels " IEEE Trans. Inform. Theory, Vol. 50, No. 6, pp. 968-985, 
June 2004. 

[17] Y. Jiang, and M. K. Varanasi, "Spatial Multiplexing Architectures with Jointly Designed Rate- Tailoring 
and Ordered BLAST Decoding - Part I: Diversity-Multiplexing Tradeoff Analysis," IEEE Trans. Wire- 
less Communication, Vol. 7, No. 8, pp. 3252 - 3261, Aug. 2008. 

[18] Y. Jiang, M. K. Varanasi, and J. Li, "Performance analysis of ZF and MMSE equalizers for MIMO 
systems: A closer study in high SNR regime," IEEE Trans. Inform. Theory, submitted Aug. 2006, and 
accepted for publication. 

[19] A. Tajer, A. Nosratinia, and N. Al-Dhahir, "MMSE Infinite Length Symbol-by-Symbol Linear Equal- 
ization Achieves Full Diversity," in Proc. IEEE International Symposium on Information Theory (ISIT 
2007), Nice, France, June 24-29, 2007. 

[20] A. Hedayat and A. Nosratinia, "Outage and Diversity of Linear Receivers in Flat-Fading MIMO Chan- 
nels," IEEE Trans. Signal Proc, Vol. 55, No. 12, pp. 5868 - 5873, Dec. 2007. 

[21] J. H. Winters, J. Salz and R. D. Gitlin, "The impact of antenna diversity on the capacity of wireless 
communication systems," IEEE Trans. Communications, Vol. 42, No. 234, pp. 1740 - 1751, 1994. 

[22] K. R. Kumar, G. Caire and A. L. Moustakas, "The Diversity-Multiplexing Tradeoff of Linear MIMO 
Receivers," IEEE Information Theory Workshop, ITW' 07, pp. 487-492, 2-6 Sept. 2007. 

[23] B. M. Hochwald, T. L. Marzetta, and V. Tarokh, "Multi-antenna channel hardening and its implications 
for rate feedback and scheduling," IEEE Trans. Inform. Theory, vol. 50, no. 9, pp. 1893-1909, Sept. 
2004. 

[24] A. L. Moustakas, S. H. Simon, and A. M. Sengupta, "MIMO capacity through correlated channels in 
the presence of correlated interferers and noise: A (not so) large N analysis," IEEE Trans. Inform. 
Theory, vol. 49, no. 10, pp. 2545-2561, Oct. 2003. 

[25] P. J. Smith and M. Shafi, "On the Gaussian approximation to the capacity of wireless MIMO systems," 
Proceedings, IEEE International Conference on Communications, p. 406, 2002. 



47 



[26] W. Hachem, O. Khorunzhiy, P. Loubaton, J. Najim and L. Pastur, "A new approach for mutual infor- 
mation analysis of large dimensional multi-antenna channels" ," IEEE Trans. Inform. Theory, vol. 54, 
no. 9, pp. 3987-2561, Sept. 2008. 

[27] D. N. Tse and O. Zeitouni, "Linear multiuser receivers in random environments," IEEE Trans. Inform. 
Theory, vol. 46, no. 1, p. 171, Jan. 2000. 

[28] Y. C. Liang, G. Pan and Z. D. Bai, "Asymptotic Performance of MMSE Receivers for Large Systems 
Using Random Matrix Theory," IEEE Trans. Inform. Theory, vol. 53, no. 11, p. 4173, Nov. 2007. 

[29] M. Debbah et ah, "MMSE analysis of certain large isometric random precoded systems," IEEE Trans. 
Inform. Theory, vol. 49, no. 5, p. 1293, May 2003. 

[30] K. Raj Kumar and G. Gaire, "Space-Time Godes from Structured Lattices," Accepted for publication 
in IEEE Trans. Inform. Theory, 2008. 

[31] D. Ghampion, J.-G. Belfiore, G. Rekaya and E. Viterbo, "Partitionning the Golden Gode: A framework 
to the design of Space-Time coded modulation," Canadian Workshop on Inform. Theory, Montreal, 
2005. 

[32] J. M. Mendel, "Tutorial on Higher-Order Statistics (Spectra) in Signal Processing and Systems Theory: 
Theoretical Results and Some Applications," Proceedings of the IEEE, vol. 79, No. 3, pp. 278-305, 1991. 

[33] D. N. Tse and S. Hanly,, "Linear multiuser receivers: Effective interference, effective bandwidth and 
user capacity," IEEE Trans. Inform. Theory, vol. 45, pp. 641657, Mar. 1999. 

[34] S. Verdu and S. Shamai, "Spectral efficiency of CDMA with random spreading," IEEE Trans, on 
Inform. Theory, Vol. 45, No. 2, pp. 622 - 640, March 1999. 

[35] S. Verdu and S. Shamai, "The Impact of Frequency-Flat Fading on the Spectral Efficiency of GDMA," 
IEEE Trans, on Inform. Theory, Vol. 47, No. 4, pp. 1302-1327, May 2001 

[36] Novikov, Sov. Phys. JETP, vol. 20, p. 1290, 1965. 

[37] A. M. Tulino and S. Verdu, "Random matrix theory and wireless communications," Foundations and 
Trends in Communications and Information Theory, vol. 1, no. 1, pp. 1-182, 2004. 

[38] C. W. J. Beenakker, "Random-matrix theory of quantum transport," Rev. Mod. Phys., vol. 69, pp. 
731-808, 1997. 

[39] D. H. Politzer, "Random matrix description of the distribution of mesoscopic conductance," Phys. Rev. 
B, vol. 40, no. 17, p. 11917, 1989. 



48 



